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Abstract 

Given  an  i.i.d.  sequence  of  random  variables  (r.v.'s)  with  continuous 
cumulative  distribution  function  (CDF)  F,  we  present  ,a  simple  construction 
for  the  jump  times  of  an  extremal  process  on  the  same  probability  space 
which  'interpolate'  the  given  record  times.  This  gives  another  approach  to 
the  strong  approximation  of  extremal  processes  as  developed  by  Deheuvels 
(1981,  1982,  1983),  and  allows  for  a  more  detailed  investigation  of  the  re¬ 
lationship  between  the  record  times  of  the  given  sequence  and  the  jump  times 
of  the  extremal  process.  In  particular,  it  it  shown  that  the  number  S  of 
surplus  jump  time  points  in  (1,*)  over  the  record  times  is  approximately 
Poisson  distributed  with  an  exact  mean  of  E(S)  =  1  -  C,  C  denoting  Euler's 
constant . 

Keywords :  Extremal  process,  record  times,  strong  approximation,  Poisson  ap¬ 
prox  imat ion . 
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1.  Introduction 


Let  {X  ;  n-  1}  be  an  i.i.d.  sequence  of  r.v.'s  with  continuous  CDF  F,  and 
n 

let  X,  .=  max{X,  , . . . ,  X  },  n  >  1.  Of  particular  interest  are  the  tiroes  U  when 
(n)  1  n  n 

these  partial  maxima  change  their  values,  defined  by 

UQ=1,  Un+1=  inf{k,;xt.  >X?J  },  n>0.  (1.1) 

'  n 


Due  to  the  continuity  of  F,  (1.1)  is  a.s.  vell-def intd ;  l'  is  also  called  the 

n 

n^  record  time,  and  X^t  the  n^  record  value  of  the  sequence.  Several  efforts 

n 

have  been  made  to  clarify  the  asymptotic  properties  of  record  times,  using 
different  approaches  such  as  canonical  representations  ([12], [13]),  strong  ap¬ 
proximation  techniques  ([3], [6])  or  embedding  into  extremal  processes  ([7], [9]), 

all  of  them  saving  that  {logU  ;  nJ  1}  asvmptot icallv  behaves  like  a  homogeneous 

n  ' 

Poisson  point  process  with  unit  rate. 

Here  the  extremal  process  (E(t);  t > 0)  (called  extremal  -  F)  is  a  right 

continuous  non-decreasing  pure  jump  Markov  process  such  that  for  all  selections 

0<  t  <  ...  <  t,  of  time  points  we  have 
I  k 


k  t  k 

P(  i  {E(t.)<x.})  =  F  1(min(x ....  x  })  Ti  F 
•  i  i  i  X  n  ,  a 

i=l  i=2 


tk"tk-l 


(min{x^, . . 


(1.2) 


where  x^,...,  x^  t  H.  Especially,  from  (1.2)  it  follows  that  we  have 

{X, X  }  =  {E(l),...,  E(n)  i  for  all  nil,  where  =  means  eaualitv  in 
UJ  (n) 

distribution.  The  structural  properties  of  such  extremal  processes  are  well- 

investigated  (cf.  [7]— [10] ) ,  and  their  importance  is  given  by  the  fact  that 

they  occur  as  the  functional  weak  limits  of  the  normalized  processes 

i\~a  );  t  >  0}  ([•]  denoting  integer  part)  where  a  elR,  b  >0  are 
b  (intj;  n  n  n 


constants  such  that  r— (X,  .  -  a  )  tends  weakly  to  an  extreme  value  distribution 

b  fnj  n 


n 


(if  applicable)  (see  e.g.  [9]  and  further  references  therein).  Further,  if 


-oo  <  n  <  00}  denotes  the  jump  times  of  the  extremal  -  F  process,  it  has  been 


shown  that  these  form  a  non-homogeneous  Poisson  point  process  with  intensity 
A(t)  =  l/t,  t>0  (in  fact,  the  extremal  process  has  infinitely  many  jumps  in 
every  neighborhood  of  the  origin).  Correspondingly,  the  sequence 


(E(T  );  -°°<  n<°°}  of  states  visited  forms  a  Markov  chain  with  transition 
n 


probabilities 


P(E(Tn+l}  >  y l E(Tn)  =  x) 


-  log  F(y) 

-  log  F (x) ’ 


y  2  x 


(1. 


where  x,v  are  chosen  such  that  0 <  F(x)  £  F(y)  <  1.  Since  the  distribution  of 

{t^}  is  independent  of  F,  (E(t)}  can  be  transformed  to  an  extremal  -  A  process 

(E*(t) }  by  letting  E*("n)  =  -  log  {-  log  F(E(Tn) ) ) } ,  t >  0,  and  interpolating  with 

piecewise  constant  paths,  where  A(x)*expfce  X),  x£  E  is  the  CDF  of  a  doubly 

exponential  distribution.  Then  (E*(t  )}  forms  a  homogeneous  Poisson  process 

n 

on  1R  with  unit  rate.  It  follows  that  the  time- transformed  process 
{E*(et);  t c  H  }  now  is  homogeneous  Poisson  both  in  time  and  space. 

In  the  light  of  (1.2),  one  might  ask  whether  extremal  processes  can  also 
be  constructed  by  some  sort  of  extension  of  the  partial  maxima  (or  records) 
from  the  original  sequence,  on  the  same  space.  Such  considerations  have  been 
recently  made  by  Deheuveis  ([1] , [2])  who  started  with  a  strong  approximation 
of  the  record  times  {U^},  which  he  then  extended  to  a  strong  approximation  of 


the  inverse  extremal  process,  and  finally  to  the  extremal  process  itself. 

We  shall  show  in  this  paper  that  also  a  'direct'  approach  is  possible,  con¬ 
structing  first  the  extremal  jump  times  from  the  given  record  times. 


This  also  allows  for  a  more  detailed  investigation  of  the  relationship 
between  the  jump  times  {t^}  of  the  extremal  process  and  the  record  times  {U  } 
completing  the  results  of  Resnick  ([7], [9]).  In  particular,  if  S  denotes  the 
number  of  surplus  points  in  the  T^-sequence  over  the  record  times,  counted  in 
the  interval  (1,*),  then  S  is  approximately  Poisson  distributed,  with  mean 

E(S)  =  1  -  C,  (1 

where  C=  .577  denotes  Euler's  constant.  An  estimation  for  the  distance  of  S 
and  the  approximating  Poisson  r.v.  in  terms  of  total  variation  is  also  given. 


4 


2.  Construction  of  the  extremal  jumps 


In  view  of  what  has  been  said  earlier,  it  is  easier  to  work  with  the  time- 


transformed  process  { E (e C ) ;  t  e  F}  since  then  the  corresponding  jump  times 


{log  -oo  <  n  <  00}  form  a  homogeneous  Poisson  process  with  unit  rate.  Further, 


by  the  general  structure  of  extremal  processes,  the  jump  times  {log  x^}  must  be 


a.s.  concentrated  in  the  random  set  u^=^(log(U^  -  1) ,  log  U^) .  In  fact,  in  our 


construction,  log  x^  e  (log(U^-l),  log  U^). 


Let,  for  real  numbers  a<b,  N(a,b)  denote  the  number  of  (log  x  )- points  in 


the  interval  (a,b).  As  a  simple  consequence  of  the  Poissonian  nature  of 


{log  x^;  n>l},  in  a  successful  construction,  the  random  variables 


=  N(iog(U^  -  1)  ,  log  lTk)  should  be  conditionally  independent  given  the  G-field 


A  =  A(L’^,LT2, . .  .  )  ,  following  a  (below)  truncated  Poisson  distribution  QO^),  say. 


with  parameter 


k  log<u  _ i> 

k 


(2.1) 


where 


QO.j)  = 


1  X 


.j  >  j  =  1,2,3,...,  (X  >  0). 
'  _  1  J  • 


(2.2) 


Further,  conditioned  on  A  and  the  number  A^,  the  location  of  the  points  in 


(log(U^-l),  log  U  )  should  be  distr ibutionally  the  same  as  that  of  an  ordered 


sample  of  a  population  distributed  uniformly  over  this  interval. 


4 OS  b 


Slo*  u* 


*°5  ai 


^ Cri  ^  ^4 


ZJ 


By  means  of  two  independent  i.i.d.  uniformly  U[0,l]-distributed  sequences 


{W  (i);  neK},  i=l,2  (which  can  independently  of  (X  }  be  defined  on  the 
n  n 

same  probability  space,  eventually  after  enlarging)  we  are  thus  able  to 
interpolate  the  given  record  times  by  extremal  jumps  in  the  following  way. 

Step  1.  Determination  of  number  of  jumps. 

Let  denote  the  CDF  of  the  truncated  Poisson  distribution  with 

Q(a) 

parameter  a > 0.  Define 


=  F 


Q(y 


(W.  (D),  k£  E, 


(2.3 


where  Ak=  log (y— — )  .  Ak 
k 

the  interval  (log(U  -  1) , 

K. 


describes  the  number  of  jump  times  to  be  implanted  in 
log  UR) . 


Step  2.  Determination  of  position  of  jumps. 
Let 

0,  k  =  0 

Bk  =  ' 

[Ax+  ...  +AR,  k>  1. 


Define  unordered  samples 


W_  (2) log(U,  -  1)  +  (1-W  (2) )log  U 

Bk-1  J  R  k-1  J 


(2.i 


for  1 <  j  <  Ak*  k  s  1.  Let 

106  15JSV  k21 

(ordered  samples) . 

Step  3.  Completion  of  the  sequence. 

Extend  the  sequence  {logxn;n^l}  to  the  whole  time  interval  (-"",“)  • 


This  can  be  done  in  different  ways;  one  possibility  is  to  construct  the  jump 

times  { log  T  ;  n < 0}  in  'reverse'  time  such  that  log T  <  0. 

n  u 

This  procedure  requires  at  most  another  countable  set  of  independent 
r.v.'s,  independent  from  the  previous  ones,  which  again  exist  on  the  same  pro¬ 
bability  space,  eventually  after  enlarging. 


3.  Extremal  jumps  and  record  times 

From  the  foregoing  it  is  clear  that  the  sequences  {logT  ;  n > 1}  and 

n 

{logU  ;  n >  1}  are  closely  related  since  log T  only  takes  values  in  the  set 
n  n 

00 

L^_^(log(U^  -  1) ,  log U  ) ;  particularly,  there  exists  some  a.s.  finite  r.v.  S 
such  that 

log T  e  (log(U  - 1) ,  log  U  )  a.s.  (3. 

n+b  n  n 

for  sufficiently  large  n  (cf.  [7], [9]).  From  here  it  follows  that 

log  U  =  log  T  Q+  o(exp{-n  +  nH(— )  })  (3. 

n  n+b  n 

=  log  +  O(logn)  a.s.  (n-*-00) 

where  tH(-^),  t>0  belongs  to  the  upper  class  of  a  Wiener  process  (see  [6]) 

since  log  -  log  (U^  -  1)  with  n->». 

n 

Relation  (3.2)  does  not  provide  the  best  possible  strong  approximation  of 

{ log  lM  by  a  homogeneous  Poisson  process  with  unit  rate.  In  fact,  in  [6]  it 

was  proved  that  there  exists  a  Poisson  process  {t  ;  n^l]  with  unit  rate,  de- 

n 

fined  on  the  same  probability  space  as  the  original  sequence,  and  a  r.v.  Z-0 

which  is  asymptotically  independent  of  this  process  such  that 

log  U  =  T  +  Z  +  o(exp{-n+ nH(— )  })  a.s.  (n  +  °°)  (3. 

n  n  n 


which  gives  an  a.s.  0(1)  rate  result.  It  was  also  shown  that  Z  can  be  repre¬ 
sented  as 


Z 


l  log(l  + 
k=l 


(3. 


where  {W  }  is  an  i.i.d.  sequence  of  U[0,l]-distributed  r.v.'s  independent  of 


K\-J 


\v-' 

V* 

"l.*  -.' 

TT 


r.y.j 


k^'rV 


E(S | A)  =  E (Z | A)  a. s . , 


(3.5) 


hence 


E(S)  =  E(Z)  =  1  -  C. 

Froof .  According  to  what  has  been  said  in  Section  2,  we  have 


U, 


E<W  •  V°*<urh>  a-s- 

k 


which  is  the  (conditional)  mean  of  the  truncated  Poisson  distribution  Q(\  ) 
Uk 

with  =  log(— — — ) .  But  a  little  analysis  shows  that  also 
k 


U  W 

Uk  l0g  (U~TT)  =  +  r^T)|V  + 1  a-s> 

k  k  lv 


(3.6) 


The  result  now  follows  by  the  observation  that  S  =  I  (A^  -  1) ;  hence 


E(S|A)  =  l  E(log(l  +  T—y)  I  A)  =  E(z|A)  a.s. 
k=l  k 


(3.7) 


In  the  light  of  (3.1),  S  is  the  surplus  number  of  points  in  (t  ;  n  i  1 


compared  with  {U  ;  n>l}. 


It  should  be  pointed  out  that  since  {log  T^}  has  i.i.d.  increments  following 


an  exponential  distribution  with  unit  mean,  the  limiting  distribution  of 


log  U  -logx  is  that  of  Z*  =  Z,  ,Y,  ,  where  {Y,  }  are  i.i.d.  exponential  random 
n  n  k=l  k  k  r 


variables  with  unit  mean,  independent  of  S,  such  that  again  E(Z*)  =  1  -  C. 
However,  Z*  and  Z  are  not  identical  in  distribution  since  P(S =  0)  >  C >  0; 
hence  Z*  has  an  atom  at  zero,  while  Z  has  no  atoms. 


Since  A  =  -  1)  and  P(A^>2)  is  "small"  one  might  expect  that  Z 

should  be  close  to  a  Poissonian  r.v.  This  can  be  precised  in  the  following 
way . 

Theorem  2.  Let  P('..)  denote  the  Poisson  distribution  with  mean  u  >  0.  Then 
with  u=l-C=  .422,  S  is  approximately  P(u)-distributed ,  with 

sup  I P(S  M)  -  P (u)  (M)  f  <  .12. 

M£2Z 

Proof .  According  to  Theorem  3  (Appendix),  if  d  denotes  the  total  variation 
X 

distance  and  P  the  distribution  of  a  r.v.  X,  we  have,  conditionally  on  A, 
with  -k  -  logC^ij), 

A 

d(P  k_1 (* |A) ,  P(  V2Xk>)  <  .06\j* 
since  > ,  ^  log (1  +•-)  <  log  2  for  all  k  ^ 1 . 

K  K 

A  little  calculus  shows  that  for  any  y>2,  we  have 

|  v  log  (-^-)  -  1  -  V2  log  |  <  ~  log"  ( 


hence  (cf.  [11]) 


d(P(V2\),  P(U ,  >.-!))<■ 

k  k  k 


and 


-l_i  2 
12  k 


d(PS(* | A) ,  P(E (Z  j  A) ) )  <  ±  l  \2  +  .06  J  >3 

k=l  k  k=l  k 


(3.11) 


by  (3.6)  and  (3.7).  From  here  it  follows  that,  if  E[P(E(zjA))]  denotes  the 


corresponding  mixed  Poisson  distribution,  that 


and  hence  by  [5] 


d(PS,E[P(E(Z|A))])  <yr  ^  E(>2)  +  .06  l  EC-3) 

1  k=l  k=l  k 


(3.12) 


oz  oo 

d(PS,P(E(Z)))  <  ■—  l  E(X2)  +  .06  l  E  ( '. 3 )  +  Var  (E(Z  !  A) )  (3.13) 


where  E(Z)  =  E  =  1  -  C.  Now,  for  y  >  2, 


,03136  log2  (^-)  <  jl-  (y-Dlog2^)  -  (y-l)2log2(^)!  <  ^  log2  Cy-) , 


hence 


Var (E(Z ) A) )  =  Var(Z)  -  E(Var(ZjA)) 


=  Var (Z)  -  I  {1  -  E[(U  -  1)C2]  -  E[(U  -  I)2),2]}  (3. ID 

k  k  k  k 


<  Var (Z)  -  .08136  £  E(A  ), 
k-1  k 


hence 


d(PS,P(h))  <  .002  l  E(A2)  +  .06  I  E(\3)  +  Var (Z) . 
k=l  k  k=l  k 


But  A^<log(l  +  — ),  and  Var(Z)  5  .09,  hence  the  result  follows  bv  some  numerical 


computations. 


Clearly,  by  the  a.s.  finiteness  of  S,  we  can  now  also  construct  the 


extremal  process  E(t)  interpolating  the  given  partial  maxima  sequence,  at 


least  for  t  >  x  where  T  =  inf{k|xo,.  e  (U  -  1  ,U, )}  is  also  a.s.  finite. 
S+T  '  b+k  k  k 


We  only  have  to  define 


v  v  >  v  v  y  •>  -  -'V'v'V  v'-  >  •  .'-/y'vV'  %  . 


Theorem  3.  Let  the  r.v.  X  have  a  truncated  Poisson  distribution  Q(A)  with 


X >  0,  and  let  Y  have  a  Poisson  distribution  P(v)  with  v  =  — .  Then,  if 

> 

0  2 

'  <  \  =  2 . 702  (the  root  of  sinh  (-y)  =  3  '■  q  )  *  we  l>ave 


d(PX_1,PY)  =  sup  jP(X-leM)  -  P(Y  c  M)  j 
Me 

=  — —  {2  sinh  (y)  -  \}{l  +  — }  (4.1) 

e  -  1 


which  in  turn  can  be  estimated  by 


^3  (2+/.)cosh(-|) 


48  A  . 

e  -  1 


(A. 2) 


Proof .  The  sup  in  the  total  variation  distance  is  attained  for  the  set 
M  =  {k  £  ZZ+}|P(X=  k+1)  <  P(Y  =  k) } ,  which  is  M  =  {0,1}  in  the  case  A<  A  . 
This  follows  from  the  fact  that 


P(Y=  k) 

if  and  only  if 

sinh(y) 


-v 

e 


> 


1  Ak+1 
e''  _  i  (k+1)! 


,k-l 

k+1 


A, 


k  >  0. 


By  our  assumption,  this  is  only  true  for  k=0,l,  which  gives  (4.1).  (4.2) 

follows  from  the  Taylor  expansion  for  sinh. 


I  3 
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